13 research outputs found

    MetAssimulo:Simulation of Realistic NMR Metabolic Profiles

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Probing the complex fusion of genetic and environmental interactions, metabolic profiling (or metabolomics/metabonomics), the study of small molecules involved in metabolic reactions, is a rapidly expanding 'omics' field. A major technique for capturing metabolite data is <sup>1</sup>H-NMR spectroscopy and this yields highly complex profiles that require sophisticated statistical analysis methods. However, experimental data is difficult to control and expensive to obtain. Thus data simulation is a productive route to aid algorithm development.</p> <p>Results</p> <p>MetAssimulo is a MATLAB-based package that has been developed to simulate <sup>1</sup>H-NMR spectra of complex mixtures such as metabolic profiles. Drawing data from a metabolite standard spectral database in conjunction with concentration information input by the user or constructed automatically from the Human Metabolome Database, MetAssimulo is able to create realistic metabolic profiles containing large numbers of metabolites with a range of user-defined properties. Current features include the simulation of two groups ('case' and 'control') specified by means and standard deviations of concentrations for each metabolite. The software enables addition of spectral noise with a realistic autocorrelation structure at user controllable levels. A crucial feature of the algorithm is its ability to simulate both intra- and inter-metabolite correlations, the analysis of which is fundamental to many techniques in the field. Further, MetAssimulo is able to simulate shifts in NMR peak positions that result from matrix effects such as pH differences which are often observed in metabolic NMR spectra and pose serious challenges for statistical algorithms.</p> <p>Conclusions</p> <p>No other software is currently able to simulate NMR metabolic profiles with such complexity and flexibility. This paper describes the algorithm behind MetAssimulo and demonstrates how it can be used to simulate realistic NMR metabolic profiles with which to develop and test new data analysis techniques. MetAssimulo is freely available for academic use at <url>http://cisbic.bioinformatics.ic.ac.uk/metassimulo/</url>.</p

    Multiple-testing correction in metabolome-wide association studies.

    Get PDF
    BACKGROUND: The search for statistically significant relationships between molecular markers and outcomes is challenging when dealing with high-dimensional, noisy and collinear multivariate omics data, such as metabolomic profiles. Permutation procedures allow for the estimation of adjusted significance levels without assuming independence among metabolomic variables. Nevertheless, the complex non-normal structure of metabolic profiles and outcomes may bias the permutation results leading to overly conservative threshold estimates i.e. lower than those from a Bonferroni or Sidak correction. METHODS: Within a univariate permutation procedure we employ parametric simulation methods based on the multivariate (log-)Normal distribution to obtain adjusted significance levels which are consistent across different outcomes while effectively controlling the type I error rate. Next, we derive an alternative closed-form expression for the estimation of the number of non-redundant metabolic variates based on the spectral decomposition of their correlation matrix. The performance of the method is tested for different model parametrizations and across a wide range of correlation levels of the variates using synthetic and real data sets. RESULTS: Both the permutation-based formulation and the more practical closed form expression are found to give an effective indication of the number of independent metabolic effects exhibited by the system, while guaranteeing that the derived adjusted threshold is stable across outcome measures with diverse properties

    Correlation Network Analysis reveals a sequential reorganization of metabolic and transcriptional states during germination and gene-metabolite relationships in developing seedlings of Arabidopsis

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Holistic profiling and systems biology studies of nutrient availability are providing more and more insight into the mechanisms by which gene expression responds to diverse nutrients and metabolites. Less is known about the mechanisms by which gene expression is affected by endogenous metabolites, which can change dramatically during development. Multivariate statistics and correlation network analysis approaches were applied to non-targeted profiling data to investigate transcriptional and metabolic states and to identify metabolites potentially influencing gene expression during the heterotrophic to autotrophic transition of seedling establishment.</p> <p>Results</p> <p>Microarray-based transcript profiles were obtained from extracts of Arabidopsis seeds or seedlings harvested from imbibition to eight days-old. <sup>1</sup>H-NMR metabolite profiles were obtained for corresponding samples. Analysis of transcript data revealed high differential gene expression through seedling emergence followed by a period of less change. Differential gene expression increased gradually to day 8, and showed two days, 5 and 7, with a very high proportion of up-regulated genes, including transcription factor/signaling genes. Network cartography using spring embedding revealed two primary clusters of highly correlated metabolites, which appear to reflect temporally distinct metabolic states. Principle Component Analyses of both sets of profiling data produced a chronological spread of time points, which would be expected of a developmental series. The network cartography of the transcript data produced two distinct clusters comprising days 0 to 2 and days 3 to 8, whereas the corresponding analysis of metabolite data revealed a shift of day 2 into the day 3 to 8 group. A metabolite and transcript pair-wise correlation analysis encompassing all time points gave a set of 237 highly significant correlations. Of 129 genes correlated to sucrose, 44 of them were known to be sucrose responsive including a number of transcription factors.</p> <p>Conclusions</p> <p>Microarray analysis during germination and establishment revealed major transitions in transcriptional activity at time points potentially associated with developmental transitions. Network cartography using spring-embedding indicate that a shift in the state of nutritionally important metabolites precedes a major shift in the transcriptional state going from germination to seedling emergence. Pair-wise linear correlations of transcript and metabolite levels identified many genes known to be influenced by metabolites, and provided other targets to investigate metabolite regulation of gene expression during seedling establishment.</p

    Piecewise multivariate modelling of sequential metabolic profiling data

    No full text
    Abstract Background Modelling the time-related behaviour of biological systems is essential for understanding their dynamic responses to perturbations. In metabolic profiling studies, the sampling rate and number of sampling points are often restricted due to experimental and biological constraints. Results A supervised multivariate modelling approach with the objective to model the time-related variation in the data for short and sparsely sampled time-series is described. A set of piecewise Orthogonal Projections to Latent Structures (OPLS) models are estimated, describing changes between successive time points. The individual OPLS models are linear, but the piecewise combination of several models accommodates modelling and prediction of changes which are non-linear with respect to the time course. We demonstrate the method on both simulated and metabolic profiling data, illustrating how time related changes are successfully modelled and predicted. Conclusion The proposed method is effective for modelling and prediction of short and multivariate time series data. A key advantage of the method is model transparency, allowing easy interpretation of time-related variation in the data. The method provides a competitive complement to commonly applied multivariate methods such as OPLS and Principal Component Analysis (PCA) for modelling and analysis of short time-series data.</p

    Muscle function, quality and relative mass are associated with knee pain trajectory over 10.7 years

    Get PDF
    Periarticular muscle plays an important role in the pathogenesis of musculoskeletal pain. Werecently reported that pain population consists of distinct subgroups of which the causes andmechanisms may differ. This study aimed to examine the association of lean mass, musclestrength and quality with 10.7-year pain trajectory. 947 participants from a population-basedcohort study were analysed. Dual-energy X-ray absorptiometry was used to assess lean andfat mass. Leg, knee extensor strength and lower-limb muscle quality weremeasured/calculated. Knee pain was assessed by the Western Ontario and McMaster Universities Osteoarthritis Index pain questionnaire. Radiographic knee osteoarthritis (ROA)was assessed by X-ray. Three distinct pain trajectories were identified: “Minimal pain” (53%),“Mild pain” (34%) and “Moderate pain” (13%). Higher total and lower-limb lean mass wereassociated with an increased risk of “Mild pain” and “Moderate pain” trajectories relative tothe “Minimal pain” trajectory group, but these associations became non-significant afterfurther adjustment for fat mass. Total lean mass percentage was associated with a lower riskof “Mild pain” [relative risk ratio (RRR): 0.95, 95%CI 0.92-0.98] and “Moderate pain”trajectory (RRR:0.92, 95%CI 0.87-0.96). Greater leg and knee extensor strength and musclequality were associated with “Mild pain” and “Moderate pain” trajectories (RRR:0.52-0.65,all P<0.05). Similar results were found in those with ROA. Higher lower-limb musclestrength and quality, and relative lean mass, are associated with a reduced risk of severe kneepain trajectories, suggesting that improving muscle function and composition may protectagainst persistent unfavourable knee pain courses
    corecore